Population Health Convergence: China, India, and Japan

Author

Jinhao, Jonathan, Tiffany, Francisco

1. Introduction

We study whether China and India are converging toward Japan in key population health indicators between 1993 and 2023. We use World Bank World Development Indicators data on life expectancy at birth, under-5 mortality, and adolescent fertility for China, India, and Japan.

2. Data description

2.1 Data source

We use data from the World Development Indicators (WDI) compiled by the World Bank. Our analysis focuses on three countries in East and South Asia: China (CHN), India (IND), and Japan (JPN). For each country we extract annual data from 1993 to 2023, giving us a 31-year panel. The raw data were downloaded as CSV files from the WDI interface and then imported into a SQLite database for cleaning and analysis.

2.2 Variables and units

We focus on three indicators that capture different dimensions of population health:

Life expectancy at birth, total (years) (SP.DYN.LE00.IN): the average number of years a newborn is expected to live if current mortality patterns continue.

Under-5 mortality rate (per 1,000 live births) (SH.DYN.MORT): the probability that a child born in a given year will die before reaching age 5, expressed per 1,000 live births.

Adolescent fertility rate (births per 1,000 women ages 15–19) (SP.ADO.TFRT): the annual number of births to women ages 15–19 per 1,000 women in that age group.

Life expectancy increases when population health improves, while under-5 mortality and adolescent fertility are expected to decline as health and socioeconomic conditions improve. Together, these three indicators allow us to study convergence in both survival and reproductive health.

2.3 Data processing and database structure

Code
erDiagram
    COUNTRIES {
        string country_code PK
        string country_name
    }

    INDICATORS {
        string indicator_code PK
        string indicator_name
    }

    WDI_VALUES {
        string country_code FK
        string indicator_code FK
        int year
        float value
    }

    COUNTRIES ||--o{ WDI_VALUES : has
    INDICATORS ||--o{ WDI_VALUES : measured_as

erDiagram
    COUNTRIES {
        string country_code PK
        string country_name
    }

    INDICATORS {
        string indicator_code PK
        string indicator_name
    }

    WDI_VALUES {
        string country_code FK
        string indicator_code FK
        int year
        float value
    }

    COUNTRIES ||--o{ WDI_VALUES : has
    INDICATORS ||--o{ WDI_VALUES : measured_as

2.4 Descriptive statistics

Code
import sqlite3
import pandas as pd

conn = sqlite3.connect("data/wdi.db")

stats = pd.read_sql_query("""
SELECT country_code, indicator_code,
MIN(value) AS min_val,
MAX(value) AS max_val,
AVG(value) AS avg_val
FROM wdi_values
GROUP BY country_code, indicator_code;
""", conn)

conn.close()
stats
country_code indicator_code min_val max_val avg_val
0 CHN SH.DYN.MORT 6.200000 50.800 23.167742
1 CHN SP.ADO.TFRT 4.673000 13.469 8.590581
2 CHN SP.DYN.LE00.IN 69.492000 78.202 74.648935
3 IND SH.DYN.MORT 27.700000 116.100 67.029032
4 IND SP.ADO.TFRT 14.060000 136.001 64.571935
5 IND SP.DYN.LE00.IN 59.823000 72.003 66.005645
6 JPN SH.DYN.MORT 2.400000 6.000 3.700000
7 JPN SP.ADO.TFRT 1.737000 6.225 4.302323
8 JPN SP.DYN.LE00.IN 79.303659 84.560 82.366397

Overall, the descriptive statistics already suggest large differences in population health across the three countries. Japan has the best outcomes on all three indicators: its average under-5 mortality rate is only about 3–4 deaths per 1,000 live births and adolescent fertility averages around 4 births per 1,000 girls, while life expectancy averages more than 82 years. India sits at the other extreme, with average under-5 mortality around 67 deaths per 1,000 and adolescent fertility above 60, and life expectancy roughly 66 years. China is between the two, but clearly closer to Japan than to India: under-5 mortality and adolescent fertility are much lower than in India, and average life expectancy is about 75 years. The wide gaps between minimum and maximum values, especially for India, indicate that all three countries experienced substantial improvements over the 1993–2023 period.

3. Data analysis(Methods)

In this section we describe how we assess convergence. First, we reshape the World Development Indicators data into a long panel and store it in a SQLite database with one table (wdi_values) containing country, indicator, year, and value. We use SQL queries to compute basic descriptive statistics by country and indicator. Then we plot time-series trends for each indicator and country to visualise how life expectancy, under-5 mortality, and adolescent fertility evolve between 1993 and 2023.

To study convergence more directly, we compute yearly gaps between China and Japan and between India and Japan for each indicator (for example, the difference in life expectancy between China and Japan in a given year) and plot these gaps over time. A gap that moves closer to zero indicates that a country is catching up to Japan. Finally, we summarise convergence quantitatively by fitting simple linear trends of each indicator on year for each country using least-squares lines (via np.polyfit). We interpret the slope as the average yearly change in that indicator. Comparing these slopes across countries and indicators helps us see where convergence is fastest (large positive slopes for life expectancy, large negative slopes for under-5 mortality and adolescent fertility).

4. Results

4.2 Convergence in gaps vs Japan

4.2.1 Life expectancy gap vs Japan

Code
# assumes 'le' was already defined in the previous chunk
# le = df[df["indicator_code"] == "SP.DYN.LE00.IN"]

# Japan series
le_jpn = le[le["country_code"] == "JPN"][["year", "value"]].rename(
    columns={"value": "LE_JPN"}
)

# Build gaps for CHN and IND vs JPN
gaps = []
for country in ["CHN", "IND"]:
    sub = le[le["country_code"] == country][["year", "value"]].rename(
        columns={"value": f"LE_{country}"}
    )
    merged = sub.merge(le_jpn, on="year")
    merged["gap"] = merged[f"LE_{country}"] - merged["LE_JPN"]
    merged["country_code"] = country
    gaps.append(merged[["year", "country_code", "gap"]])

gaps_df = pd.concat(gaps, ignore_index=True)

# Plot the gaps
plt.figure()
for c in ["CHN", "IND"]:
    sub = gaps_df[gaps_df["country_code"] == c]
    plt.plot(sub["year"], sub["gap"], label=c)
plt.axhline(0, linestyle="--")
plt.xlabel("Year")
plt.ylabel("Life expectancy gap vs Japan (years)")
plt.title("Convergence in life expectancy relative to Japan")
plt.legend()
plt.show()

gaps_df.head()

year country_code gap
0 1993 CHN -9.811659
1 1994 CHN -9.861829
2 1995 CHN -9.173098
3 1996 CHN -9.396756
4 1997 CHN -9.197146

The gap plot makes the idea of convergence more explicit. For both China and India, the life-expectancy gap relative to Japan is negative throughout the period, meaning they always lag behind Japan. However, the lines trend upward toward zero, indicating that the gaps are shrinking over time. China’s gap narrows from just under −10 years in the early 1990s to roughly −6 years by 2023, while India’s gap shrinks from around −19 to approximately −12 or −13 years. The temporary spike downward around 2020, especially for India, suggests that the pandemic temporarily widened the gap before improvements resumed.

The first few rows of the gap table help quantify the early period. In 1993–1997, China’s life expectancy is consistently about 9–10 years lower than Japan’s, with only small year-to-year changes. Over the full sample, the gradual movement of these gap values toward zero supports the conclusion that both countries are slowly converging toward Japan, but China is converging faster and remains much closer to Japan than India does by the end of the period.

4.3 Average yearly changes in indicators

Code
import numpy as np

rows = []

indicator_labels = {
    "SP.DYN.LE00.IN": "Life expectancy at birth",
    "SH.DYN.MORT": "Under-5 mortality",
    "SP.ADO.TFRT": "Adolescent fertility"
}

for ind in ["SP.DYN.LE00.IN", "SH.DYN.MORT", "SP.ADO.TFRT"]:
    for c in ["CHN", "IND", "JPN"]:
        sub = df[(df["indicator_code"] == ind) &
                 (df["country_code"] == c)].dropna().sort_values("year")
        # simple linear fit: value ≈ slope * year + intercept
        slope, intercept = np.polyfit(sub["year"], sub["value"], 1)
        rows.append({
            "country_code": c,
            "indicator_code": ind,
            "indicator_name": indicator_labels[ind],
            "slope_per_year": slope
        })

slopes = pd.DataFrame(rows)
slopes
country_code indicator_code indicator_name slope_per_year
0 CHN SP.DYN.LE00.IN Life expectancy at birth 0.297968
1 IND SP.DYN.LE00.IN Life expectancy at birth 0.393853
2 JPN SP.DYN.LE00.IN Life expectancy at birth 0.172199
3 CHN SH.DYN.MORT Under-5 mortality -1.578669
4 IND SH.DYN.MORT Under-5 mortality -3.059919
5 JPN SH.DYN.MORT Under-5 mortality -0.120524
6 CHN SP.ADO.TFRT Adolescent fertility 0.028329
7 IND SP.ADO.TFRT Adolescent fertility -4.886938
8 JPN SP.ADO.TFRT Adolescent fertility -0.089134

The slope estimates summarise the average yearly change in each indicator. For life expectancy, all three countries have positive slopes, with the largest gains in India, followed by China and then Japan, which is already close to the frontier. For under-5 mortality and adolescent fertility, all slopes are negative, indicating declines; the largest absolute reductions occur in India, reflecting rapid catch-up from a much higher baseline. These estimates reinforce the visual impression that convergence is strongest in child mortality, moderate in life expectancy, and weakest in adolescent fertility.

5. Discussion

Our results show clear and sustained improvements in population health for all three countries between 1993 and 2023, but from very different starting points. Japan consistently has the best outcomes: under-5 mortality is close to zero, adolescent fertility is low, and life expectancy remains above 80 years throughout. China occupies a middle position, with much better outcomes than India but still below Japan. India starts the period with high child mortality, high adolescent fertility, and the lowest life expectancy, although all three indicators improve substantially over time.

In terms of convergence, the evidence is strongest for life expectancy and under-5 mortality. The life-expectancy gap relative to Japan shrinks steadily for both China and India, with China moving from roughly 10 years behind Japan to about 6 years by 2023, and India narrowing a much larger gap from nearly 20 to around 12–13 years. Descriptive statistics (and, in additional plots, not shown here) suggest similar catch-up in child mortality: both China and India reduce under-5 mortality by large margins, with the steepest proportional declines in India. Adolescent fertility also falls in all three countries, with China approaching Japan’s low levels and India remaining higher but clearly moving in the same direction. Taken together, these patterns suggest partial convergence in survival and reproductive health, with China much closer to Japan than India by the end of the period.

The cross-country differences likely reflect broader economic and social trajectories. Japan entered the 1990s with a mature health system and demographic transition, so further gains are incremental rather than transformative. China experienced rapid economic growth, expansion of health insurance coverage, and major public-health campaigns, which plausibly support its strong catch-up in life expectancy and child survival. India also grew economically and expanded basic health services, but starting from a lower baseline and facing larger within-country inequalities, which may help explain why convergence is slower and gaps with Japan remain large, especially for adolescent fertility.

There are several limitations to our analysis. First, we rely on three national-level indicators, which cannot capture regional, rural–urban, or socioeconomic inequalities within each country. Second, our methods are descriptive: we document trends and gaps but do not attempt to identify causal effects of specific policies or shocks. Third, the WDI series may contain measurement error or revisions, particularly in earlier years. Future work could extend this project by incorporating additional health outcomes, using subnational data, and applying more formal convergence tests or regression models that adjust for economic and demographic covariates.

6. Conclusion

This project asked whether China and India are converging toward Japan in key population health indicators over the past three decades. Using WDI data on life expectancy at birth, under-5 mortality, and adolescent fertility from 1993 to 2023, we find strong evidence of improvement in all three countries and clear, though incomplete, convergence toward Japan. China has moved substantially closer to Japan, especially in life expectancy and child mortality, while India has also narrowed the gap but remains considerably behind on all indicators. Overall, the results highlight both the success of long-run health improvements in East and South Asia and the persistence of inequality in population health levels across countries, suggesting that continued investment in child health, women’s health, and broader social development will be essential for further convergence.